An evolutionary correlation-aware feature selection method for classification problems

نویسندگان

چکیده

As global search techniques, population-based optimization algorithms have provided promising results in feature selection (FS) problems. However, their major challenge is high time complexity associated with the exploration of a large space and consequently number fitness function evaluations. Moreover, interaction between features another key issue FS problems, directly affecting classification performance through selecting correlated features. In this paper, an estimation distribution algorithm (EDA)-based method proposed three important contributions. Firstly, as extension EDA, each iteration generates only two individuals competing based on function, evolving during using our update procedure. Secondly, we provide guiding technique to determine be selected for iteration. result, final solution would optimized evolution process. These lead increasing convergence speed algorithm. Thirdly, main contribution addition considering importance alone, can consider features, being able deal complementary increase performance. To do this, conditional probability scheme that considers joint The introduced probabilities successfully detect Experimental synthetic dataset proved approach facing these types Furthermore, 13 real-world datasets obtained from UCI repository showed superiority comparison some state-of-the-art approaches. evaluate effectiveness subset, support vector machines are used classifier. efficiency analysis experimental non-parametric statistical tests had significant advantages other

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correlation–based Feature Selection Strategy in Classification Problems

In classification problems, the issue of high dimensionality, of data is often considered important. To lower data dimensionality, feature selection methods are often employed. To select a set of features that will span a representation space that is as good as possible for the classification task, one must take into consideration possible interdependencies between the features. As a trade-off ...

متن کامل

A Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization

Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...

متن کامل

An Evolutionary Feature Selection Algorithm for Classification of Human Activities

This paper proposes an evolutionary feature selection algorithm to classify human activities. Feature selection is one of the key issues in machine learning, along with classification when some parts of features are not available or have redundant information. It enhances learning accuracy by selecting essential features and eliminating nonessential features. In the proposed algorithm, a featur...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

An Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification

In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Swarm and evolutionary computation

سال: 2022

ISSN: ['2210-6502', '2210-6510']

DOI: https://doi.org/10.1016/j.swevo.2022.101165